Explanation of the Gibbs paradox within the framework of quantum thermodynamics 



o 
o 

(N 

in 



(N 
> 

in 

o 
o 
in 
o 

:^ 

■ 

a • 

^ : 



A.E. AUahverdyan^^ and Th.M. Nieuwenhuizen^^ 
Yerevan Physics Institute, Alikhanian Brothers St. 2, Yerevan 375036, Armenia, 
■^^ Institute for Theoretical Physics, University of Amsterdam, 
Valckenierstraat 65, 1018 XE Amsterdam, The Netherlands 

The issue of the Gibbs paradox is that when considering mixing of two gases within classical 
thermodynamics, the entropy of mixing appears to be a discontinuous function of the difference 
between the gases: it is finite for whatever small difference, but vanishes for identical gases. The 
resolution offered in the literature, with help of quantum mixing entropy, was later shown to be 
unsatisfactory precisely where it sought to resolve the paradox. Macroscopic thermodynamics, 
classical or quantum, is unsuitable for explaining the paradox, since it does not deal explicitly 
with the difference between the gases. The proper approach employs quantum thermodynamics, 
which deals with finite quantum systems coupled to a large bath and a macroscopic work source. 
Within quantum thermodynamics, entropy generally looses its dominant place and the target of 
the paradox is naturally shifted to the decrease of the maximally available work before and after 
mixing (mixing ergotropy). In contrast to entropy this is an unambiguous quantity. For almost 
identical gases the mixing ergotropy continuously goes to zero, thus resolving the paradox. In this 
approach the concept of "difference between the gases" gets a clear operational meaning related to 
the possibilities of controlling the involved quantum states. Difficulties which prevent resolutions 
of the paradox in its entropic formulation do not arise here. The mixing ergotropy has several 
counter-intuitive features. It can increase when less precise operations are allowed. In the quantum 
situation (in contrast to the classical one) the mixing ergotropy can also increase when decreasing 
the degree of mixing between the gases, or when decreasing their distinguishability. These points 
go against a direct association of physical irreversibility with lack of information. 

PACS numbers: PACS: 05.70Ln 



I. INTRODUCTION. 

Studying mixtures and mixing processes is one of the 
oldest tasks of thermodynamics. Perhaps the most cel- 
ebrated aspect of this task is the Gibbs paradox: the 
entropy increase upon mixing two different gases stays 
finite for an arbitrary small difference between the gases, 
but is zero for identical gases. 

This paradox is discussed in many textbooks on ther- 
modynamics and statistical physics — e.g., in — and 
it created a vast amount of literature during the last hun- 
dredyears till our days |3i d ^, ^, 2, 1, 1, Iflfc IH E III 
0j El El' It was stated to be of a high princi- 
pal and methodical value since it displays the limits 
of applicability for classical (phenomenological) thermo- 
dynamics: the resolution of the paradox (if any) ought 
to lie outside this discipline. Already several times the 
paradox was claimed to be resolved, but each time it was 
reconsidered and seen as an open issue again. 

The present status of the problem is somewhat contro- 
versial. The existing opinions can be roughly summarized 
as follows. 

(1) The paradox is resolvable within the informa- 
tion theoretical approach already in classical statistical 
physics EE- 

(2) The most natural resolution of the paradox has 
been achieved within quantum statistical physics 
0,HI3 thanks to the feature of partial distinguishability. 

(3) The quantum situation presents a natural setting 
for the resolution, but there is a specifically quantum pe- 
culiarity of the problem (induced by non-commutativity) 



which still prevents its ultimate resolution El- Thus, 
the Gibbs paradox in quantum statistical physics has so 
far not been resolved. 

We share the last opinion. Our purpose is to present 
an explanation of this thermodynamical paradox starting 
from the first principles of quantum mechanics. 

This is the program of quantum thermodynamics, see 
|l8j for a short review. The crucial point in our discus- 
sion of the Gibbs paradox is to realize that it has to be 
formulated in terms of the available work, as was already 
realized by Lande in 1955 '3|. (Within the setup of classi- 
cal thermodynamics this formulation is equivalent to the 
entropic one 6]). In contrast to entropy, the available 
work - by definition an ensemble average - is a well- 
defined quantity for any equilibrium or non-equilibrium 
state even of small quantum systems and it is a function 
of both the state and the class of work sources employed 
for work extraction. Moreover, the features of work are 
grounded directly on the first principles of quantum me- 
chanics. On top of that, the amount of available work 
adequately reflects intuition usually associated with en- 
tropy, such as being a measure of non-equilibrium or dis- 
order ^. 



^ In fact the priority of the available work over entropy was ade- 
quately understood already by Clausius; see discussions in |2d. 
Another example is Schrodinger, who in his famous book |21| . 
gave importance to (neg)entropy for characterizing survival of 
organisms, but later on admitted that he should have spoken in 
this context about the available work rather than entropy. 
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The above first-principles properties lead to the res- 
olution of the paradox in terms of mixing work which 
shows perfectly continuous behavior when the difference 
between the gases goes to zero. Difficulties which pre- 
vent resolutions of the paradox in its entropic formula- 
tion do not arise here. Certain aspects of the proposed 
scheme — using work instead of entropy, time-scale sepa- 
ration, etc. — were already anticipated in literature, e.g. 
in 01 O- However, these anticipations were conceived 
only in the framework of phenomenological thermody- 
namics, and this is why the resolution in terms of mixing 
work was not achieved ^. 

Our paper is organized as follows. In section^ we re- 
call the classical formulation of the Gibbs paradox. Next 
section reviews the mixing entropy argument, an attempt 
to solve the paradox with help of quantum entropy. Sec- 
tion ?? discusses in detail why this argument cannot be 
considered as a resolution of the paradox. Two basic 
reasons for this are outlined and several pertinent is- 
sues are discussed. Section ?? presents the resolution 
of the paradox with help of mixing work. In section ?? 
we discuss the mixing work in the contexts of instru- 
ments available for work-extraction. The analysis fully 
embodies the idea that the difference between two sub- 
stances is first of all an operational notion and should not 
be given any absolute status "2^. Moreover, it appears 
that the dependence of the mixing work on the available 
instruments is non-trivial, since it can both increase or 
decrease upon introducing restrictions on those instru- 
ments. Though the mixing work is zero when mixing 
identical substances, in the quantum situation it can be 
a non-monotonous function of the degree of mixing and 
of the (information-theoretic) distinguishability between 
the mixing substances. These are shown in sections ?? 
and ??, respectively. The last section presents our con- 
clusions. Appendix ?? discusses definitions of entropy 
and their relations to the second law; Appendix ?? rec- 
ollects several formulas. 



of the same substance. The difference is not specified, 
but assumed to be tunable. The number of particles N, 
pressure P and temperature T in each reservoir are the 
same. The entropy S of each gas is Q 

SiN,V)^Nln^ (1) 

Since the gases do not interact, the total entropy reads 

Si^Si{N,V) + S2iN,V)^2Nln^. (2) 

Now remove the wall. The overall system of the two 
gases is assumed to be thermally isolated (the only influ- 
ence of the external fields is in removing the wall) ^ . The 
gases will mix, and after some transient time, a new equi- 
librium state is reached. Since in this state gases still do 
not interact, the final entropy Sf can be obtained again 
as a sum of two partial entropies, every component with 
N particles distributed in the volume 2V, 

2V 

Sf^Si{N,2V) + S2{N,2V)=2N\n— (3) 

Thus the mixing entropy reads 

AS' = S'f-S'i = 2A^ln2. (4) 

The additional contribution 2iVln2 arose due to the ir- 
reversible process of mixing, and it does not depend on 
any quantitative measure of the difference between the 
ideal gases. 

Now consider the same process, but assume that ini- 
tially the gases are identical. After removing the wall, 
Eq. Q does not predict any entropy change. Indeed, in 
the final state we have a one-component gas with total 
number of particles 2N in the volume 2V. Thus, from 
Eq. (P), Sf = 2iVln|^ and this equals S'i, so AS = 0. 
This is, of course, the expected and consistent result. 



II. CLASSICAL FORMULATION OF THE 
GIBBS PARADOX. 

Consider two reservoirs each one having volume V. 
They are separated by a wall and are filled with different 
ideal ^ Boltzmann gases, e.g., with two different isotopes 



^ The to be presented resolution of the Gibbs paradox resembles 
the recent solution for the Maxwell demon problem presented by 
Scully and co-workers |22| : both find their basis in quantum ther- 
modynamics, that is to say, the thermodynamics of small quan- 
tum systems coupled to a macroscopic bath and work source, the 
latter leading to a time-dependent Hamiltonian. Another recent 
result of quantum thermodynamics is our report on the break- 
down of the Landauer i neq uality for the energy needed to erase 
one bit of information ligl . 

^ For simplicity we choose to work with ideal gases. The ideality is 
not an issue for the Gibbs paradox: it exists for non-ideal gases 



as well 01 , and the resolution obtained below for ideal gases will 
be generalizable to the non-ideal situation. 

* In formula (??) we omitted a term N fm{T) with / being some 
function of temperature, e.g., fm{T) = | — ^ In ( ) ^ 
monoatomic gas. This term does not play any role in our dis- 
cusion, since it drops out from the entropy difference. One also 
should not be troubled by the presence of the dimension inside of 
the logarithm in 0, because it is canceled by the one of fm{T), 
while in our further discussion it drops out anyhow when calcu- 
lating entropy differences. 

^ We shall focus on the mixing in the thermally isolated system. 
For ideal gases this coincides with the isothermal mixing, since 
the energy U of such a gas depends only on its temperature: 
U/N = fm{T) - Tf'^{T), where the function fm{T) is discussed 
in Footnote 111 In general (i.e. for non-ideal gases), there will be a 
difference between the isothermal mixing, where the temperature 
is kept constant during the whole process with help of an external 
thermal bath, and the mixing in the thermally isolated system, 
where the final temperature is determined by the constancy of 
the overall energy. During the isothermal mixing the gases will 
exchange some energy with the bath (mixing heat). 
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since there is no irreversibility when mixing two identical 
gases in equilibrium; see in addition below and Footnote 
??. Thus we have arrived at the Gibbs paradox |^^|^: 

6 

• When varying continuously the difference between 
the gases, the entropy defined according to Eqs. ^ 
13 changes discontinuously. 

It is to be stressed that the existence of the Gibbs 
paradox is not connected with the thermodynamic limit 
N ^ oo. As discussed in the finite- situation 

does bring some differences in the expression for the en- 
tropy of mixing, but the paradox survives; see in addition 
Footnote ??. 



One of the main points in taking the internal states 
into account is that now from the very beginning we can 
treat the two gases as identical, but being in different 
internal states pi and P2 0) S [l^- This is similar to 
what happens in nuclear physics, where the neutron and 
proton are considered as identical particles (nucleons) in 
different states distinguished by the value of the isotopic 
spin. 

(b) After removing the wall, the gases mix. We shall 
assume that the time-scale on which the internal states 
of the gases change is much larger than the time-scale 
related to mixing of the translational degrees of freedom. 

(c) Thus after the mixing, the internal states will be 
described by the density matrix {M ~ 2) 



III. MIXING ENTROPY ARGUMENT. 



M 



(6) 



A. Assumptions of the argument. 

It was realized by many scholars that the origin of the 
paradox is that the difference between the gases is only 
assumed, but does not show up explicitly in Eqs. 1^- 
01, i.e., the description that led to the discontinuity is 
not sufficiently complete 0, H S ■ In that respect the 
paradox demonstrates the limits of applicability of phe- 
nomenological thermodynamics. 

It is expected that for two ideal gases the difference 
will be related to the internal states of their atoms |^ 0, 

(a) Indeed, besides the translational motion which 
contributes to the entropy (??), the atoms of the gases 
also have internal states (e.g., spin states). These states 
are typically described by quantum mechanics. For 
Boltzmann gases the internal states of the atoms are de- 
coupled from the translational motion. Returning to the 
above example of different gases in two reservoirs, let us 
assume that the first and second reservoirs contain atoms 
in internal states described by density matrices 



Pi 



and 



P2, 



(5) 



respectively 



Since two equal amounts of gases are mixed, the proba- 
bility (weight) factors are equal, Ai = A2 = ^. 

The same Eq. ^ applies for the mixing of M gases 
with number of particles {Na}^^i and initial density ma- 
trices {pa}^=i', the corresponding weights are 



A^^ 



a = 1, ...,M. 



(7) 



For the details of this generahzation see section ??. 

B. Implementation of the argument 

Due to the above decoupling feature, the total entropy 
of the translational motions and the internal states of 
each gas is defined as [recall Footnote 0] 

Sk{N,V)= Nlii^+NS^^ipk), fc=l,2, (8) 
S'vn(p) = -tr [plnp] , (9) 

where p^ are given by ©, and where S'vn(p) is the von 
Neumann entropy. 

The initial entropy of the two gases is the sum of two 
contributions (recall that Ni = N2 = N) 



° The paradox is not always formulated correctly in literature; see 
13 for detailed criticism. Some authors define entropy as Af In y 
and see the paradox in increasing the entropy when mixing two 
identical gases. Others think that the paradox is resolved by the 
very fact of not having any entropy increase when mixing iden- 
tical gases. To avoid confusion, we stress that the paradox is in 
the discontinuous change of entropy when tuning the difference 
between the gases. A closely related point — which can also be 
viewed as paradoxical — is that the mixing entropy does not de- 
pend on the actual difference between the gases, provided this 
difference is not zero. 

Recall that the density matrix — as well as the wave func- 
tion — refers to an ensemble of identically prepared systems; see, 
^•g-i Ql' Thus by "state of a particle" we necessarily mean the 
density matrix of the ensemble to which this particle belongs. 



Si^2N\ii^+NS,^ipi) + NSMP2), (10) 



while the final entropy reads 
2V 



Sf^2Nln^ + 2NSMp)- 



(11) 



Recall that we treat two gases as identical; so in the final 
state there is a single gas having 2N particles in volume 
2V. The mixing entropy AS* = S{ — Si thus reads: 



AS* = 2N 



S'vn(p) - 2'5'vn(pi) - -^Svn{p2) 



(12) 
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Assume that the internal states were maximahy differ- 
ent, i.e., orthogonal. 



P1P2 



0. 



(13) 



Such states can be distinguished by a single measure- 
ment, i.e., if it is known that the state of a given single 
atom belongs to an ensemble described by either pi or p2, 
then a single measurement suffices to establish the iden- 
tity of the state. In this respect orthogonal states are 
similar to the classical case (perfect distinguishability). 
It is seen from definitions ??) that 



'5'vn(p) = -tr 



tr 


Pi 


In^l 


- tr 


'P2 


In^l 




. 2 


2 J 




. 2 


2 J 



(14) 



and that the mixing entropy A^* — 2iVln2 agrees with 
the prediction of classical thermodynamics. 

The other extreme is when the states are identical. 



Pi = P2, 



(15) 



C. Generalization to several mixing gases. 

We shall indicate how (??) changes for the mixing of 
two gases having initially non-equal number of particles 
and non-equal volumes. The generalization to the mixing 
of several gases will be straightforward. 

Let the first and second resevoirs contain, respectively, 
iVi and N2 particles in volumes Vi and V2. Since we 
are interested in irreversibilities coming due to mixing 
only, we should assume that the initial pressures P and 
temperatures T of the two gases are equal both initially 
and finally. The known ideal-gas relation PV ~ NT, 
applied for V ^ Vi, V2, Vi + V2 and N = Ni, N2, Ni + 
N2, implies 



P 
T 



Vi 



N2 _ N1+N2 
V2 " V1+V2' 



(20) 



Using (??) and proceeding along the same lines as 
when deriving (??), we get for the mixing entropy {M = 
2) 



which implies AS' = again in agreement to the predic- 
tion of classical thermodynamics. 

In general, if neither (??) nor (??) is true, the states 
pi and p2 are only partially distinguishable, i.e., any fi- 
nite number of measurements will distinguish these states 
with a finite error. Assume the states are pure: 



pi = |ai)(ai| and 
Noting the spectrum 



P2 = |a2M«2| 



(16) 



Spec { i|ai)(ai| + ^\a2){a2\\ = ^ (1 ± |(ai|a2)|) , (17) 



we get from (??) 



AS 



M 



M 



SvN ( ^ AqPq ] — ^ ^aSv-M{Pa), (21) 



where 



a = 1, ..,M. 



(22) 



are the fractions of the two gases in the final density 
matrix. We already wrote Eqs. (??-??) such that they 
hold for any M >2. 



IV. CRITIQUE OF THE QUANTUM MIXING 
ENTROPY ARGUMENT. 



^.-tr[plnp]^.fl-K^^ 
h{x) = —a; In a; — (1 — a;) ln(l — x). 



(18) 
(19) 



This expression is minimal, and equal to zero for identical 
gases I (ai \a2) | = 1. It is maximal and equal to 2N In 2 for 
totally distinguishable (orthogonal) states |(ai|a2)| = 0. 
In the intermediate case < |(ai|a2)| < 1, AS* changes 
continuously, a conclusion that holds more generally [3. 
This was seen as a resolution of the Gibbs paradox [y, 0, 
^. We shall recall counter arguments in section 



Note that there are several differences between the positions un- 
dertaken by the authors of 0, |3, |3 versus the one of Lande in 
12 . The detailed analysis carried out in |3 suggests that the 
approach by Lande contains errors, and his final formulas for the 
entropy of mixing are different from those in 



A. Thermodynamic entropy of mixing is ill-defined 
in quantum mechanics. 

The above argument on the continuous change of AS 
was seen by many as the resolution of the Gibbs paradox 
- and it is often still believed to be. However, a more 
detailed analysis has shown that this explanation creates 
a new conceptual problem . Let us recall the following 
features of the thermodynamical entropy: 

• If two states A and B are connected by an irre- 
versible process A ^ B, then for defining thermo- 
dynamically the entropy change during this pro- 
cess, we should connect those states by a certain 
reversible process A B — possibly by involving 
thermal baths and sources of work — and calculate 
the entropy change AS via the Clausius formula 



AS(A^B)./ f 



(23) 
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where dQ and T are, respectively, the differential 
heat (received from thermal baths) and the tem- 
perature. 

Eq. (??) provides entropy with an operational 
meaning and makes it observable via macroscopic 
measurements. Indeed, determining, e.g., the von 
Neumann entropy via its definition (??) implies 
knowledge of all eigenvalues of the corresponding 
density matrix p. This knowledge is not available 
for typical macroscopic or mesoscopic systems. 

• A reversible process A ^ B is defined by requiring 
that it is possible to pass back along the same tra- 
jectory and to return to the same thermodynamical 
state ^, such that, in particular, the entropy change 
during the resulting cyclic process A ^ B ^ A is 
equal to zero: Xv^b^a ^ 

Any statistical definition of entropy is expected to 
agree with the above thermodynamical one. An in- 
spection shows, however, that this is not the case 
[lj|: the partially distinguishable (i.e., non-orthogonal) 
states — which were supposed to solve the paradox — cre- 
ate in this respect an inconsistency. It appears that for 
such states there is no reversible mixing process. Let 
us first of all note that when the internal states pi and 
P2 are orthogonal — that is, they correspond to definite 
eigenvalues oi and 02 of some physical observable (her- 
mitean operator) A — it is possible to separate the mixed 
gases, and at least in principle to fulfil the requirement 
of a cyclic process. What one needs for this purpose is a 
suitable Hamiltonian 

ff,ep-/(r,A), (24) 

which establishes strong correlations between the inter- 
nal states of the atoms and their translational motion de- 
scribed by the position vector f (24] : the function f{r, a^) , 
with i — 1,2, is very small for r being in, respectively, 
first and second reservoirs. The magnitude of iJsop has 
to be sufficiently large, so that all other terms in the 
overall Hamiltonian can be neglected. Together with a 
low temperature bath, weakly coupled to the gases, the 
Hamiltonian Hgcp will drive the system towards its min- 
ima and it will separate the mixed gases back into differ- 
ent reservoirs 24J without changing the internal states of 
the atoms (since [Hscp,A] = 0). There can be practical 
limitations on this procedure related, e.g., with limita- 
tions on the magnitude of iJsop, but in principle such a 
process is possible. Thus, one can apply (??) and recover 
of the usual thermodynamical formulas for entropy [2^. 

The problem is that once the gases described by ini- 
tially partially distinguishable (non-orthogonal) density 



Thermodynamical state is defined by the values of certain macro- 
scopic quantities, such as pressure, temperature, magnetization, 
entropy, etc. 



matrices pi and p2 (state A) are mixed with weights Ai 
and A2, respectively (state B), then it is impossible to 
go back to the original state by any process such that 
the two gases return to their original states: There is no 
Hamiltonian similar to i?scp in (??) which can achieve 
such a separation |24|| . in particular because pi and p2 
do not form eigenstates of any hermitian operator. 

Are there, however, measurements which can help to 
achieve this separation? We need a careful discussion 
of this question, since the existing opinions — e.g., those 
presented in — seem to us somewhat unclear. 

First of all, we note that the procedure involving Hscp 
can be seen as a measurement, where the role of the mea- 
suring apparatus is played by the classical coordinate r 
of the atom . The motion of this apparatus amounts to 
the separation of the gases. The above question can be 
thus reformulated as to concern other measuring appara- 
tuses (not connected with the coordinates) and their role 
for separation of the gases. Our answer to this question 
is negative, and here is why. 

Using the example given by (??), it is seen that 
there is not any measurement which would discrimi- 
nate unambiguously — and without disturbing the initial 
states — between |ai) and |a2), if (a2|ai) is neither zero 
nor one [2^. Thus, it is impossible to separate the gases 
without disturbing the states of their atoms. However, 
requiring cyclic changes of every single atom state is too 
much for a thermodynamical reasoning. It suffices to 
require cyclic change of all collective (macroscopic vari- 
ables) of the gases. In particular, the (final) internal 
states of the atoms in each reservoir are to be described 
by the density matrices pi = \ai){ai\ and p2 = |a2)(a2|, 
respectively. Such (generalized) measurements do exist 

. Assume for simplicity that the internal state is a spin- 
i represented by Pauli matrices a = (ai, a2, cts)- One 
comes with another set of particles carrying spin-i de- 
scribed by Pauli matrices s = (si, S2, S3). The spins a 
and s undergo a controlled unitary evolution, after which 
one measures, e.g., S3 with help of a suitable macro- 
scopic measurement apparatus. After selecting measure- 
ment results (i.e., the eigenvalues ±1 of S3), the initial 
mixed ensemble p = Ai|ai)(ai| -f A2|a2)(a2| of the tr spin 
is separated into two subensembles pi — |ai)(ai| and 
P2 = 1^2) (02 1; with the probabilities (weights) Ai and A2, 
respectively [2^|23. This is the desired separation. 

However, quantum measurements are by their very na- 
ture non-cyclic, since dissipative processes are connected 
with the motion of the pointer variable. In the above 
example both the spin s and the apparatus measuring 



A closely related quantum mechanical model for quantum (and 
classical) measurements was re cent ly analyzed in detail in col- 
laboration with R. Balian, Ref. l25ll 

This is a known fact in the physics of quantum ensembles; see, 
e.g., Ref. l26l . The described procedure amounts to POVM (pos- 
itive operator measured values). Recently we dis cuss ed in detail 
its implications for defining fluctuations of work l27l . 
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S3 have undergone such non-cyclic processes. These cer- 
tainly do generate an independent (and sizable) amount 
of entropy which is not taken into account in (??). 

• In summary, the possibility to define a cyclic pro- 
cess is a necessary condition for the thcrmodynam- 
ical meaning of entropy. When mixing gases that 
have non-orthogonal states, there are no suitable 
cyclic unmixing processes. This precludes entropy 
from having the proper thermodynamical meaning. 
Thus, trying to solve the problem in one place the 
quantum mixing entropy argument creates a new 
problem almost at the same time. The Gibbs para- 
dox thus remains unexplained. 



1. Why it is impossible simply to define entropy via the von 
Neumann formula? 

In the context of the above objection to the thermody- 
namical meaning of the mixing entropy in the quantum 
situation, one can ask why it is not possible simply to de- 
fine entropy via the von Neumann formula (??) without 
worrying on its precise relation to other thermodynamical 
notions. If desired, such a definition may be motivated, 
e.g., via information-theoretic arguments 0. 

In our opinion this is not possible to do, since entropy 
in statistical physics is never defined as an independent 
macroscopic observable; note again that the calculation 
of the von Neumann entropy via Eq. (??) requires the 
knowledge of the full spectrum of the density matrix p, 
which is microscopic information normally not available 
for statistical systems. For internal states, it may be 
available, though. More generally, entropy cannot be de- 
fined from first principles without taking into account 
the corresponding formulations of the second law of ther- 
modynamics, a fact that strictly speaking precludes any 
really non-circular derivation of these entropic formula- 
tions from first principles 0. In contrast, formulations 
of the second law that operate with work instead of en- 
tropy norma,lly do have first principle derivations; see, 
e-g-, |11,|2^|3^{. The non- unique character of entropy is 
recalled and illustrated in Appendix ??. 



B. The employed notion of "difference between 
gases" does not have a clear operational status. 

Another difficulty with the above argument is that this 
attempted resolution of the paradox does not depend on 
the available experimental instruments and tools to be 
employed in control of the internal states of the atoms. 
As it stands within the entropic argument, the resolution 
depends on the difference between the states which is de- 
termined by their initial preparations via density matri- 
ces pi and p2- However, preparation and control are dif- 
ferent things and in general cannot be combined in a sin- 
gle density matrix. As an example, consider preparation 



of a Gibbsian state with density matrix p oc exp[— _ff/T], 
where H and T are, respectively, the Hamiltonian and 
the temperature. This preparation needs only a weak 
interaction between the system and a thermal bath at 
temperature T; it does not contain any information on 
what we can measure or control in this system. 

On general grounds, it was argued in [23 that the res- 
olution of the Gibbs paradox has to be operational, since 
there are situations when two objects are formally dif- 
ferent, but no computable (i.e., solvable by algorithms) 
operation can establish this difference. Worse, we can- 
not exclude unknown laws of physics that in the future 
would force us to distinguish (states of) atoms or parti- 
cles which in our present understanding are considered 
as identical. 

This operational aspect is also important, because, 
as we see below for the approach that takes this 
properly into account, the dependence on the avail- 
able instruments is non-trivial: less refined instruments 
can — depending on the situation — indicate less or more 
irreversibility of mixing. 

One may perhaps counterargue the above criticism by 
noting that the operational meaning and the dependence 
on the available instruments might be provided by the 
information-theoretic approach to statistical physics; see, 
e-g-, [m 1131 ■ We, however, should simply note that 
information-theoretic constructions are not at all guaran- 
teed to have the proper physical meaning, as we saw for 
the above reversibility problem. Moreover, uncritical use 
of information-theoretic concepts may by itself lead to 
problems; see, e.g., [l^, where the first-principle deriva- 
tion of the Landauer bound for information erasure was 
found in conflict with the information-theoretical one. 



V. RESOLUTION OF THE PARADOX VIA THE 
CONCEPT OF MAXIMAL MIXING WORK. 

The main point of the present paper is to employ 
quantum thermodynamics — the thermodynamics of fi- 
nite systems coupled to a macroscopic worksource and 
possibl y to a macroscopic bath. As realized in earlier 
works IS 1^ J this approach generally acknowledges 
that one should study work instead of entropy - in the 
absence of a thermodynamic limit the latter has no firm 
meaning and each definition leads to a new value. This 
shift of paradigm will allow us to resolve the Gibbs para- 
dox without the difficulties and ill-defined meaning of 
the mixing entropy argument. The reason for this so- 
lution lies in the fact that work and its properties are 
deduced from the first principles of quantum mechanics 
without any need of thermodynamic postulates (such as 
reversibility or existence of cyclic processes); see in this 
context Footnotes ?? and ??. In other words, the resolu- 
tion of the paradox is sought by going to the first princi- 
ples of quantum mechanics alone, and without involving 
any thermodynamic argument. 

We start by recalling the definition of available work for 
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a general, thermally isolated process done on a quantum 
system. 



A. Definition of work. 

A quantum system is described at the initial time t = 
by a density matrix p{0) and interacts with an exter- 
nal macroscopic work source. The resulting evolution 
of the system is generated by (an effective) Hamiltonian 
H{t) = H{R{t)}, which is time-dependent via classical 
(c-number) parameters R{t) (control fields). 

We shall be concerned with processes where the change 
of the Hamiltonian is cyclic 



is identified with the average energy change of the 
system H 0] 



H{t) = H{0) = H. 



(25) 



The situation where the work-source interacts with the 
system for a finite time belongs to this class of processes, 
since the corresponding system-work-source interaction 
Hamiltonian is zero both initially and finally Note 
that processes with a cyclic Hamiltonian are obviously 
different from the processes that are cyclic in the sense of 
various macroscopic quantities. However, it is necessary 
to have a cyclic change of the Hamiltonian for the process 
to be cyclic in the sense of macroscopic quantities. 

Thus, the process is assumed to be thermally isolated 
and the Hamiltonian H(t) generates a unitary evolution: 



in-p{t) = [H{t),p{t)], 



!Xp 



AsH{s) 



(26) 
.(27) 



where fecp denotes the time-ordered exponent, ft is well 
known that, in general, a Hamiltonian evolution for two 
coupled systems does not reduce to a Hamiltonian evo- 
lution for one of them. However, in the present case the 
evolution of the system is Hamiltonian owing, in partic- 
ular, to the macroscopic character of the work-source, as 
discussed in 

• The work W done by the external source between 
times and t in the thermally isolated process ^'^ 



All constructions below generalize to processes, where the ini- 
tial and the final Hamiltonians are different. In the context of 
the Gibbs paradox this more general setting may provide some 
advantages, though it does not give any conceptual novelty as 
compared to the cyclic- Ham iltonian case. 

The appendix of Ref. 1321 contains a clear discussion of cer- 
tain additional conditions that have to be satisfied for the time- 
dependent Hamiltonian evolution and for the proper identifica- 
tion of the work-source. 

Prom the viewpoint of work-exchange every process can be com- 
pleted to a thermally isolated one by including in the system 
its environment (e.g., thermal baths). Then the work (??) for 
this thermally isolated process coincides with the usual definition 



W = tr[p{T)H{T)-p{Q)H{Q)] 



(28) 



• Due to conservation of [average] energy, W is equal 
to the average energy decrease of the work source. 

• This is a classical, mechanical energy that can be 
transferred with 100% efficiency to other macro- 
scopic work-sources, and, in particular, it can 
transferred to another mechanical degree of free- 
dom performing classical deterministic motion. 

• W is typically observed via suitable (classical) mea- 
surements done on the macroscopic work source, 
or, alternatively, by measuring the initial and fi- 
nal average energies on the ensemble of (many) 
identically prepared systems. Both these ways are 
routinely employed in practice, e.g., in NMR/ESR 
physics, where the system corresponds to spin-i 
under influence of external magnetic fields (work- 



• The definition of work and its features are based 
purely on the first principles of quantum mechanics. 
They do not depend on thermodynamical concepts, 
such as reversibility . In contrast, the work as it 
is known in thermodynamics can be deduced from 
the first principles of quantum mechanics. 



B. Maximally available work. 

One of the fundamental tasks of thermodynamics is 
to determine the maximal amount of work which can 



of work for an arbitrary process: W = dttv {pg{t) dtH(t)), 
where is the time-dependent density matrix of the sys- 

tem. Indeed, let He and Hj be, respectively, the Hamiltonian of 
the environment and the system-environment interaction. Re- 
call that the work-sources act only on the system; thus the 
total Hamiltonian W(t) of the system-|-environment is 'W(t) = 
H{t) + + Hj, where only the system Hamiltonian H{t) is 
time-dependent. To prove the desired statement we have to write 
down the expression (??); W = tr[ps_B(T) H(t) - pse{0) "^(0)], 
where psE{t) is the time-dependent density matrix of the sys- 
tem-|-environment, apply the von Neumann equation of motion 
for the thermally isolated process: ipSE = {H,Pse\^ a-^d trans- 
form = dttr{ps{t)atH{t)) = ^ dttr (psE{t)dtn{t)) with 
help of integration by parts. 

In particular, the definition of a reversible process can be based 
on the notion of work 1331 . A process is reversible if it can be 
supplemented by its mirror refiection that goes back along the 
same trajectory; ii) the work done on this completed process is 
zero. It is also clear that the definition of heat need not super- 
sede the definition of work. The reason for this is that from the 
viewpoint of work-exchange any process can be completed to a 
thermally isolated one, where the work is uniquely related to the 
energy; see Footnote ?? in this respect. 
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be extracted from a given (non-equilibrium) system in 
the initial state p under cyclic-Hamiltonian (sufficiently 
smooth) processes (??). The latter condition is imposed, 
since otherwise there may not be any limit in the ex- 
tracted work (e.g., for the final Hamiltonian being nega- 
tive and very large by the absolute value). It is via this 
task posed by Clausius and solved within phenomenolog- 
ical thermodynamics that entropy acquires its physical 
meaning as a measure of order related to high-graded en- 
ergy (work) din. While the standard solution of this 
task is well known and based on the notion of reversible 
process (in the same way as the definition of entropy is), 
it was recently shown that the problem can, and should, 
be solved from the first principles of quantum mechanics 
without invoking any thermodynamical axiom 
The solution differs from the standard one, though the 
latter provides a correct bound for the maximal work 
WKmax in (??) below, and is expected to agree with it 
when phenomenological thermodynamics is supposed to 
apply, i.e., for weakly non-equilibrium states of generic 
macroscopic systems. 

To describe the solution to the maximal work ex- 
traction problem, we denote the eigenresolutions of the 
Hamiltonian H and of the density matrix p as, respec- 
tively, 



H = Y,eu\ek){ekl P = (29) 

fe=l k=l 



where {\£k)}l=i and {\pk)}l=i with (efc|e;) = {pk\pi) = 
Ski are the eigenvectors of H and p, respectively, and 
where Sk and pk are the corresponding eigenvalues. We 
shall assume that always be ordered as 



£l<£2<.... (30) 



over all smooth, cyclic Hamiltonians 

H{t) = H + V{t), V{Q)^V{t)^0, (33) 

where r is the cycle time of the Hamiltonian Min- 
imizing over the Hamiltonians in (??) is equivalent to 
minimizing over all unitary operators U |3^ : this is why 
we denoted this minimization as min^. An explicit for- 
mula for the optimal Hamiltonian is given in 34], while 
the result of the minimization in (??) yields the ergotropy 

n 

W = tr [pH] - ^ pieu > 0. (34) 

k=l 

This is a difference between the final and initial average 
energies of the system, as it should be for the work ex- 
tracted in a thermally isolated system. It has a simple 
interpretation: since in quantum mechanics the eigenval- 
ues of p are conserved under the unitary evolution caused 
by macroscopic external sources, the lowest final energy 
is reached when the largest eigenvalue of p becomes the 
ground-state occupation, the one but largest eigenvalue 
occupies the first excited state and so on. Various fea- 
tures of W, in particular those contras ting the thermody- 
namical intuition, were studied in |32l l34j We suggested 
to call (??) the ergotropy of the state p. 

It is seen from (??) that no work extraction is possible 
(i.e., W = 0) if p is a monotonically decreasing function 
of H: 

p = fiH), rix)<0. (35) 

This, in particular, includes Gibbs equilibrium states 
p cx e^^/'^ , where T > is the temperature. This con- 
firms Thomson's formulation of the second law: no work 
extraction from an equilibrium state by means of cyclic- 
Hamiltonian processes . 



The non- increasing ordering of {|pfe)}fc^i is denoted as 

p[>pi>.... (31) 

Then the maximal available work is defined as [s^ 

W^niax = -W = miiiu{trH[p{T) - p(0)]}, (32) 

where W is the non-negative absolute value of the maxi- 
mal work, and where the minimization in min u is taken 



Note that for an n-level system the minimization over all Hamil- 
tonians (??) can be carried out by minimizing over Hamiltoni- 
ans of the form H(t) = 1^ + ^2'i—i ^ii^)-^i< whsre bi{t) are time- 
dependent c-functions, and where Xi are operators such that any 
generator of the group SU(n) can be obtained via linear combina- 
tions oi H, Xi , X2 , . . . , Xm and their multiple commutators .35] . 
For 71 = 2 and H = this Hamiltonian is H{t) = + b(t)ai, 
with (Ti and 0-3 being the corresponding Pauli matrices. For 
n = 3 the analogous Hamiltonian is H{t) = X-}+bi{t)Xi+b4(t)\4, 
where Afc are the Gell-Mann matrices [generators of SU(3)]. 
If the minimization is carried out via Hamiltonians A3 -|- fei(t)Ai , 
the unitary transformations act only on the upper left 2x2 sector 
of the 3x3 density matrix p. 

We note that there are no restrictions on the product of r with 
the typical magnitude of V{t) (i.e., on the dimensionless coupling 
constant characterizing the sources of work). It is also assumed 
that the initial state p is known. Limitations on this knowledge 
will, in general, lower the value of the maximal work. 
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C. The operational meaning of the available work. and translational degrees of freedom: the total Hamilto- 



The concept of maximal work takes into account the 
notion of available instruments. Indeed, in (??) we op- 
timized the extracted work over all cyclic-Hamiltonian 
thermally isolated processes, which assumes that the op- 
timal one is available. If there are restrictions on the 
availability of sources of work, the amount of extractable 
work will, in general, be smaller than W. It is even pos- 
sible that no work at all can be extracted by some re- 
stricted class of work sources 

To make this point clear, let us assume that the possi- 
ble unitary evolutions U in (??) are restricted to permu- 
tations of the diagonal elements 



TTfc 



(36) 



of the density matrix p in the energy representation 
Then instead of (??) we will have 



'W\p,H) ^^TT^efc-tr (pi/) (37) 



71 



fe=l 



In general, we have for the ergotropy 
W > W, 



(38) 



(39) 



where the equality sign is realized for [p, H] = Q. It is 
now possible that tt^ = tt^ and thus W' — 0, though 
yy > due to the non-diagonal elements of p. 



D. Explanation of the paradox. 

We shall now immediately deal with M gases with ar- 
bitrary weights \a = j^, and the total of particles 



M 



(40) 



Let us return to the assumptions presented in section 
IIII Al and list them again: i) The necessity of taking into 
account the internal states, ii) Decoupling of the internal 



The class of employed work-sources corresponds to what in Il5l 
was called a thermodynamical construction: a set of non-relaxed 
mechanical degrees of freedom that define the very meaning of 
various thermodynamical quantities. 

There are, of course, many other ways to introduce limitations 
on t he available unitary evolutions. For more examples, see 
Ref. [.Sfil . as well as, the last part of Footnote ?? and Footnote 
77 



nian H^"^} of each gas contained in the corresponding 
reservoir is 



rr(") _ TT 
■"tot - -'^O 



(a) 



No, 

E 



1,...,M. 



(41) 



where Hq is the sum of kinetic energies of all N gas par- 
ticles plus the potential generated by the walls of the 
reservoir, and where is the Hamiltonian of inter- 

nal motions of the atom with index i belonging to the gas 
with index a. Since we assume that all atoms in both 
reservoirs are identical and differ by their states only, 
we shall assume that all atoms have the same internal 
Hamiltonian: 



H. 



(42) 



Hi) Time-scale separation between the translational 
and internal degrees of freedom during the mixing; thus 
Eq. ©, p = ^aPa, holds for the post-mixed 

density-matrix for M gases with the initial internal states 
Pa and arbitrary weights Aq . 

In our opinion, these assumptions are physically sound; 
it is only their implementation within the mixing entropy 
argument that is problematic. We shall avoid that ar- 
gument by using work (more precisely, its maximum in 
absolute value, ergotropy) instead of entropy. 

Before mixing, how much work can be extracted from 
the total system containing M separate gases? The an- 
swer depends on the specification of the interaction be- 
tween the gases and the sources of work. These interac- 
tions are chosen under the following assumptions: 

(1) Since the gases are ideal, it is natural to assume 
that the sources act on each particle separately, i.e. the 
sources by themselves do not introduce interparticle in- 
teractions. 

(2) Work sources act on the internal degrees of freedom 
only. This is because the internal and the translational 
degrees of freedom are decoupled, and because the trans- 
lational degrees of freedom are in (local) equilibrium, so 
it is useless to try to extract any work from them; recall 
our discussion around (??). 

(3) We allow different sources of work to act on differ- 
ent gases. This is again reasonable, since the gases start 
out perfectly separated from each other. 

Given the above assumptions we are led to the follow- 
ing time-dependent, internal Hamiltonian for each gas 



N 
i=l 



(43) 



Since all particles within the given reservoir are equiva- 
lent, we have 



where 



F(")(0) = F(")(r) = 0, 



(44) 



(45) 
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as required by the cyclic-Hamiltonian feature [compare 
with (??)]. 

It is now seen that the maximal work extractable from 
the pre-mixed state reads 



M 



(46) 



where W{pa, H) defined in (??) is the maximal work ex- 
tracted from the initial state pa with the initial (and 
final) Hamiltonian H. Note that Wi is proportional to 
the total number of particles Af = J2a=i thanks to 
the above assumptions respecting the ideal gas structure 
of the problem. 

Let us now determine how much work we can extract 
after the M gases have mixed. The above conditions for 
system-work-source interaction remain valid except the 
last one: 

(3') Since the gases now form a single homogeneous 
system with the density matrix p — ^aPa, we can- 

not enforce the different particles (atoms) to couple to 
different sources of work. At best we can couple the 
Af — X)q=i particles with the same type of work 
sources. Thus, the physically acceptable cyclic Hamil- 
tonians has the form (??). 

The resulting maximal work reads from ??): 



(47) 



The difference between (??) and (??) is defined to be 
the maximal mixing work or mixing ergotropy AW: 



AW = Wi - Wf 



M 



M 



(48) 
(49) 

(50) 



k=l 



where we employed (??), and where p\, and p\, ^ are non- 
increasingly ordered eigenvalues of p and pa, respectively. 

The fact that maximal work cannot increase upon mix- 
ing, 



AW > 0, 



(51) 



should be obvious from the very construction. Here is, 
however, the formal proof. Recall (??) and note that 

maxc/„ (tr ] - tr [ff t/„ p„ L/^ ] ) (52) 
= tr[ffp„]-tr[ff{7„p„c7t] (53) 
> tv[Hpa]-tv[HU paU'^], (54) 

where Ua is the optimal unitary operator which maxi- 
mizes (??), and where U is any other unitary operator. 



including the one which maximizes tr [ Hp — HU pU"^]. 
The desired (??) is now recovered via multiplying (??- 
??) by Aq, and summing over a. 

The very same argument applies if the maximization 
in the definition of W is carried out over a restricted 
class of unitary operators or cyclic Hamiltonians (we as- 
sume, of course, that this is the same class initially and 
finally). Analogous to (??), we then deduce from (??) 
that AW' > 0. 

Turning to the conceptual implications of the mixing 
work AW, we note that, of course, AW — for pa — p, 
when identical gases are mixed. Moreover, it goes to zero 
continuously with pa — > P- 

• We therefore consider this continuity of maximally 
extractable work as the resolution of the Gibbs 
paradox within quantum thermodynamics. 

The first objection for the entropic argument — see our 
discussion around (??) and Ref. [lj| — is now harmless, 
since now the concept of thermodynamical reversibility 
is not employed anywhere; the machinery of the max- 
imal work-extraction is based completely on quantum 
mechanics alone. As we stressed repeatedly, work is a 
first-principle concept, more fundamental than entropy 

[ii] 20. 

Note that when pi and p2 are pure states, the converse 
of the above statement appears to be valid: if pi and p2 
are different, then AW > 0. This is because the only 
pure state that cannot provide work is the ground state 
of the Hamiltonian H . If, however, at least one of the 
two density matrices is mixed, there are different states 
pi and p2 such that AW — 0. For the simplest example 
recall (??) and take as pi and p2 two equilibrium states 
with different temperatures Ti and T2. 

To illustrate the above statements in more detail, we 
turn to the density matrices given by (??-??), where 
the Hamiltonian H has two energy levels and e > 0. 
Recalling (??) we get from (??) that AW is a simple 
function of the overlap: 



AW = :^[l-|(ai|a2) 



(55) 



For completely distinguishable, classical states 
|(ai|a2)| = this gives AW = e/2, while for iden- 
tical states |(ai|a2)| — 1, AW = 0. The classical 
argument describes only these extremes (i.e., completely 
different or identical) and, thus, creates the paradox. 



In phonomenological thermodynamics, the problem of the max- 
imal work extraction is treated by employing the reversibility 
concept and features of entropy [y. In our opinion, this is the 
reason why the concept of work — though ment ione d as a help- 
ful one for interpreting the Gibbs paradox ll,*!! — was never 
seriously employed for resolving the paradox. 
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VI. HOW THE MIXING WORK DEPENDS ON 
THE AVAILABLE INSTRUMENTS. 

Let us now turn to the second objection against the 
entropic argument. We recall from section ?? that once 
the difference between two states is recognized to be an 
operational notion — two states may not differ under in- 
spection by some instruments, but turn out to be differ- 
ent if more refined ones are used — we should expect that 
this feature is reflected in a satisfactory resolution of the 
Gibbs paradox. 

As we stressed repeatedly, the notion of available work 
is operational in the above sense. So is the mixing work 
defined in (??). Moreover, the situation is non-trivial, 
since AW can both increase or decrease under restrict- 
ing the available instruments (i.e. system-work-source 
interactions), as we show now. 

To illustrate this fact, let us take the internal Hilbert 
space of all particles having two dimensions (e.g., spin-i): 



1 



Pa 



(1 + 71q a) , 



1,...,M, 



(56) 



where t? — ( ui , (T2 , cts ) are the Pauli (2 x 2) matrices, 
and where 



= (»^l,a,«2,a,n3,a), |^a|<l 

is the Bloch c-vector. Recalling the spectrum 
Spec { Pa } = i (1 ± ImqI) , 



we get from (??), 



AW 




M 



On the other hand, if for the Hamiltonian 

^_ 5(1-1-^3) 



(57) 



(58) 



(59) 



(60) 



the maximization over the uninary operators in (??) is 
carried out only over those unitary operators which per- 
mute the diagonal elements of the corresponding density 
matrices in the energy representation [compare with (??, 
??)], the mixing work will read 



AW' = 



Ne 



M 



M 

E 

Q=l 



where n^. a is the third component of the vector n. 
It is obvious that there are cases where 

AW > AW', 



(61) 



(62) 



e.g., choose n^^a all having the same sign which leads to 
AW' = 0. It is, however, less expected that there can 
also be situations where 



AW' > AW. 



(63) 



• use of less precise instruments can increase the 
amount of mixing work. 



To show this, let us choose the case 

M 



0, 



(64) 



and write from (??, ??) 
AW' - AW 



N{e/2) 

M 



A 



In (??) we use the inequality^^ 



fx — \/x + y > — 



2Vi' 



(65) 
(66) 

(67) 



Taking for simplicity Xani ^ — ^01^2. a — b and 
|Ac[n3,Q,| = a — and thus M should be even to satisfy 
(??) — we get 



AW' - AW > V2MAf\b\ (^l - 



(68) 



By suitable choice of a and b, one can make the RHS of 
(??) positive, thus proving the desired statement (??). 



VII. MIXING WORK AND THE DEGREE OF 
MIXING. 

As we saw, the mixing work is zero when there is no 
true mixing, i.e., when the internal states of the mixed 
gases are identical It is expected that the mixing work 
will decrease together with the degree of mixing. 

Consider the mixing work AW(A) as a function of the 
weights A — {Xa}aLi- For fixed states {pa}a=iJ 
pect that if A is more inhomogeneous than fl, then 



AW (/I) > AW(A). 



(69) 



This means: 



To prove (??) make an incomplete Taylor expansion for f{x) = 
v^: fix + y) = fix) + yf'ix) + ^/"(C): where x<i<x + y, 
and disregard ^fH) < 0. 

Note that when the overall numbers of particles A^^ in each re- 
servior is not very large, even the mixing of completely identical 
gases brings about changes in their final state j9j . This is due to 
different fluctuation characteristics of the translational motion 
0, e.g., before mixing the number of particles in the volume Va 
is precisely Na , while after mixing this number of particles will 
fluctuate being equal to Na only on average. We shall neglect 
this effect assuming A^^ to be sufficiently large. 
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Here is an exaggerated example illustrating (??): for two 
species the degree of mixing is expected to be higher when 
having 100 particles of each type than when having 199 
and 1, respectively. The weights for this example are, 
respectively, /^i = /X2 = ^ and Ai = A2 = 

Below we clarify in which sense the intuitive expecta- 
tion (??) is correct. 



A. Majorization. 

First we need the proper formalization for the notion 
of "inhomogeneous" . This is provided by the concept of 
majorization [3^ which we shortly recall below. 

For two sets of probabilities A — {\a\^^i and \i — 
{/^ctliLi: A majorizes /i (i.e., A is more inhomogeneous 
than /I), denoted as 



if for all 1 < m < M 

m m 
Of— 1 a— 1 



(70) 



(71) 



where A''- means non-increasing ordering of A [recall (??)]• 
To illustrate (??): the uniform vector (1/A/, 1/M) 
is majorized by all other probability vectors, while any 
deterministic vector, e.g. (1,0..,0), majorizes all others. 
It follows from (??) that /(Aa) < Eali /(A^a) for 

any concave function /(x) '32|, e.g., ]{x) — —x\clx (en- 
tropy). 

The majorization property is transitive: ^ -< A and 
\ ^ V imply \1 ^ V. Also \ < and /2 -< A, imply 
\l = fj,l. However, this property is incomplete: for n > 3 
there are vectors A and fl for which neither A majorizes 
fl, nor does jl majorize A [3^. 



B. Quasi-classical situation. 

Let the initial states of the gases be M pure, orthonor- 
mal states 



Pa = \lpa){'ipa\, (V'alV'/?) = 



(72) 



We call this situation quasi-classical, since following 
the original formulation of the Gibbs paradox within clas- 
sical thermodynamics, the internal states are completely 
distinguishable and provide definite values for any ob- 
servable that has {\ipa)}aLi as its eigenfunctions. 

Let us now prove that if A is more inhomogeneous than 
jl, i.e., if (??) holds, then inequality (??) is valid. To this 
end we first employ summation by parts 



ekXi = en- (£2 - ei)Ai - (£3 - £2)(Ai + A^) 

- (£4-£3)(Ai + A^ + A^)-..., 



k=l 



and then recalling (??) we get 
AW{fl) - AW{X) 



k=l 



£fc [4 - Ai 



= (£2 - £i)(Ai - + (£3 - £2)(Ai + xi-fi{- 4) 

+ ...>0. (73) 

Here each separate term is non- negative due to (??, ??). 

• For this quasi-classical situation the above intuition 
(more mixing means larger mixing work) is correct. 

C. Quantum situation. 

Let us assume that the initial states {palaLi are not 
orthogonal. For simplicity we shall work with the sim- 
plest non-trivial situation: 



(74) 



i.e., two-dimensional internal state and two mixed gases. 
AW is now given by (??). We assume that A is more 
ordered than fl in the sense of majorization, which for 
M ~ 2 implies: 



Ai > A2 



Ml > M2, Ai > Ml. 



(75) 



Note that for the considered two-dimensional situation, 
n = 2, the majorization order coincides, e.g., with the 
entropic order: Eq. (??) imphes — AilnAi — A2lnA2 < 

-Ml In Ml -M2 lnM2- 

We now intend to clarify under which conditions the 
inequality (??) holds. Recalling (??) this inequality is 
equivalent to 



(Mi-Ai)(|ni|-|n2|) 



(76) 



> 



(MiI^iI + M2|^2| f - 2^i/i2|ni| |n2|(l - cost 



COS( 



- sj ( Ailnil + A2|n2| )' - 2AiA2|ni| |n2|(l ■ 
where cos 4> is defined as 

(77) 

1, inequality 



ni • 712 = "1 F2 COS( 



When both states are pure, \ni\ 
(??) reduces to M1M2 > A1A2 or 

Ai +Mi > 1, 



"2 



(78) 



a condition which is always satisfied in view of (??). 

Assume in (??) that (j) is small, and expand (??) to 
first order of 1 — cos^. After algebraic steps we get a 
generalization of (??) 



Ai + Ml > 1 + AiMi 



1 - 



l"i| 

l"2| 



(79) 



This inequality is already not always satisfied. When 
l'^i|/l"'2| is sufficiently small, i.e., one of the states is 
considerably more mixed, Eq. (??) may be violated; take, 
e.g., Ai = 0.8 and fj,i = 0.7. We conclude that 



13 



• In the quantum situation the mixing work may be 
a non-monotonous function of the degree of mix- 
ing, though it goes to zero continuously when the 
substances become identical. 



VIII. DISTINGUISHABILITY AND MIXING. 

Another way to control the mixing is to keep the 
weights equal, but make the internal states pi and p2 
closer to each other. It is natural to ask whether the 
mixing work is a monotonic function of the difference 
between these substances, i.e., whether decreasing this 
difference always makes the mixing work smaller. Below 
we are going to show that this is not always the case, 
though the mixing work, of course, goes to zero in the 
limit of identical substances. 

First of all we need a clear understanding of the proper 
distance (closeness) between two density matrices pi and 
P2- The answer is trivial for pure states as in (??): any 
monotonic function of the overlap 



tr(pip2) = I(ai|a2)r 



(80) 



can be taken as the proper degree of closeness. 

The generalization of the overlap (??) to mixed states 
is also well known and was derived from several different 
perspectives [s^ls^. This "distinguishability" reads: 



d{pi,P2) = 



tr 



1/2 1/2 
Pi P2P1 



(81) 



Let us note that d{pi,p2) is symmetric 

d{pi,P2) ^ d{p2,pi), (82) 

concave 

d{p, xpi + (1 - x)p2) > xd{p, pi) + (1 - x)d{p, P2), (83) 

and varies between and 1, 

0<d(pi,P2) < 1, (84) 

being equal to 1 if and only if pi — P2- It is also multi- 
plicative 

d{pi (gi /93, P2 fXi Pi) = d{p2,pi)d{p3,p4:), (85) 

invariant under unitary transformations, 

d{pi,p2)=d{U piU\U p2U^), U^U = l, (86) 

it increases under completely positive evolution, and re- 
duces to tr ( P1P2 ) if pi or p2 is pure. 

In particular, d{pi,p2) has the proper information- 
theoretic meaning as arising from the statistical distance 
between the data acquired by optimal measurements car- 
ried out for distinguishing between pi and p2 38]. 



In Appendix ?? we determine d{pi, P2) for two spin ^ 
density matrices pi and p2, given as in (??), with Bloch 
vectors ni and n2, respectively: 



2d{pi ,P2)-l = ni-n2 + k^iP Vl- l^2p. (87) 

For pure states \na\ — 1, and we expectedly obtain from 
(??) propotionality between the overlap and the scalar 
product of the two Bloch vectors. 

For the mixing work we have from (??) 

AW £ 

-Jj^ = 2 +A2|^2| 



A^|ni|2 + \l\n2\'^ + 2A1A2 ni ■ ri2 



When comparing (??) with (??) we see that if only 
the scalar product fii ■ n2 is varied — with the modules 
I nil and \n2\ being fixed — making the two states closer, 
the mixing work AW indeed monotonically decreases. In 
particular, this is the case for pure states pi and p2. How- 
ever, as seen from (??), for mixed states pi and p2 the 
scalar product between the corresponding Bloch vectors 
is only one aspect of closeness. To look at another setup, 
vary with \n2\ while keeping their mutual angle 
fixed (see (??) for the definition of 0). Note that AW 
always increases with |ni|: 

9AW e 



d\ni\ 2 |Ai?li -I- A2n2| 

(|Aini + A2n2| - Ai|ni| - A2|n2| cos0) > 0. (89) 

On the other hand, we have from (??) 

dd{pi,p2) _ 1 
d\ni\ " 2^1-|ni|2 

(cos0|n2| v/1- kiiP - l"i| v/l-|«2P) (90) 

When the scalar product is positive: ni • n2 = cos > 0, 
Eq. (??) can be positive, i.e., the states can get closer 
with increasing |ni | , if |ni | is sufficiently small, or if |n2 1 is 
sufficiently close to 1. Comparing with (??) we conclude: 

• It is possible to make the two states of the mixing 
substances closer to each other and simultaneously 
increase the mixing work 

We stress that all conclusions of the present section 
are valid under other reasonable measures of distance be- 
tween pi and p2, e.g., tr [(pi — ^2)^] ■ Indeed, it amounts 
to a simple check that the qualitative conclusion we got 
after (??) is valid as well for this measure of closeness. 



Note the differ enc e with 2tr (pip2) — 1 = n\ ■ n2. 
^'^ Note that Ref. Il6l discusses a similar situation in classical chem- 
ical physics. The analog y, h owever, appears to be superficial, 
since the author of Ref. [ifij bases his conclusions on the non- 
additive classical formula S^iiN, V) = NlnV for entropy. 
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IX. CONCLUSION. 

Since its formulation in the late 1870's, the Gibbs para- 
dox has, lacking a simple solution, become a quest for 
the understanding of phenomenological thermodynam- 
ics from a more fundamental theory. This attempt to 
go to a deeper level is the reason for its importance 
P, IE IE B I3- Its understanding happens to have 
several layers. First, it was realized that it is necessary 
to take into account explicitly the difference between the 
particles, which drives the classical formulation of the 
paradox, but how much they differ shows up nowhere 
in formulas. Together with the separation of character- 
istic relaxation times and the von Neumann definition 
of entropy, this brought about the quantum mixing en- 
tropy argument which for many years was seen as the 
resolution of the Gibbs paradox |y, 0, IE IE It was, 
however, pointed out that the argument introduces a new 
conceptual difficulty precisely when it claims to solve the 
paradox The details being presented in section ITTll 
we simply recall that this difficulty has to do with the 
features of entropy, more precisely, with the fact that the 
entropy is not a sufficiently primitive (first-order) quan- 
tity in the situation at hand. So a deeper reduction level 
has to be involved for the resolution of the Gibbs para- 
dox. 

In our opinion, the basic reason why classical ther- 
modynamics fails for the understanding of mixing en- 
tropy is that the difference between an A atom and a B 
atom is not dealt with properly, in particular, because 
no macroscopic limit is involved in differences between 
gases A and B. Lacking such a limit, the basis for phe- 
nomenological thermodynamics, be it based on classical 
or quantum statistical physics, has disappeared and its 
application indeed leads to paradoxes and ill-defined is- 
sues such as the non-operational nature of the mixing 
entropy. We are thus left with the search for a more 
fundamental approach. Such a possibility is offered by 
the field of quantum thermodynamics, that has been 
considered in recent yea rs by several groups, see e.g. 

EE liE 111 El IM IM M IM IM EE IMM eS El 

In the current paper we have presented an explanation 
of the Gibbs paradox within this field. Here the notion of 
entropy is known to be easily blurrred, and a paradigm 
shift is called for towards the more ancient concept of 
work [energy transferred to macroscopic work sources], 
which still plays a clear and empirical role. In particu- 
lar, quantum thermodynamics applies to finite systems, 
e.g., the basic formulations of the second law are well- 
defined both conceptually and operationally (30j . Indeed, 
it could be shown that Landauer's principle that connects 
the minimal energy dispersion to erase one bit of informa- 
tion AQ > kT In 2 may loose its validity in the domain of 
quantum thermodynamics [l^ , while the Maxwell demon 
problem just found new viewpoints there |22ll45j| . 

It was further shown that the maximally extractable 
work (which we called 'ergotropy' in an earlier paper with 
R. Balian (33|) can be clearly defined before and after 



the mixing process. The difference between them defines 
the maximal mixing work, or mixing ergotropy, a non- 
negative quantity which smoothly goes to zero when the 
substances become more and more equal to each other, as 
it is for a single substance, thus solving the Gibbs para- 
dox in the work formulation. (As should be clear from 
our presentation, we consider that the Gibbs paradox in 
its entropic formulation has not been properly solved so 
far, and that we even do not believe that it is consistently 
resolvable in that form.) 

In contrast to entropy, the features of work can be 
directly based on the first principles of quantum me- 
chanics and are well-defined for any (equilibrium or non- 
equilibrium) state of a system interacting with macro- 
scopic sources of work. In particular, there is no need 
to involve features of thermodynamical reversibility for 
defining and interpreting the mixing work; see in this con- 
text Footnotes ?? and ??. On top of that, the concept of 
maximal work has a well-defined operational character, 
because it is always defined with respect to a definite class 
of work-sources acting on the system of interest. The fea- 
tures of work and entropy are recalled and contrasted in 
sections ??, ?? and Appendix ??. Recall in this context 
that the concept of work was already employed in the 
literature devoted to the Gibbs paradox [E [lEl j but its 
potential applications were conceived in the framework 
of phenomenological thermodynamics. In that way, they 
encounter almost all objections raised against the mix- 
ing entropy argument. Only after the problem of maxi- 
mal work-extraction was solved from the first principles 
|34| , it became possible to approach an explanation of the 
Gibbs paradox with the help of the mixing work. This 
explanation is free of the difficulties which plagued the 
quantum entropy argument. 

To keep our approach as natural as possible, we have 
supposed that, after allowing the gases to mix, the trans- 
lational degrees of freedom equilibrate rather quickly, 
while their spin degrees of freedom do not equilibrate 
at all at the timescales for which our discussion applies 
because their dynamics are supposed to take place on a 
much larger time scale. For this reason, these degrees of 
freedom can be considered as not coupled to the bath, 
which saves us from discussing the more complicated sit- 
uation where heat exchange of the spins with the bath 
would also matter. 

The consistent resolution of the paradox presents fea- 
tures that might not have been anticipated before. It 
appears that less precise control can, depending on the 
situation, bring a larger or a smaller amount of mixing 
work. We have also seen that a naive intuition relat- 
ing the degree of mixing and the distinguishability with 
the mixing ergotropy may not always be correct: some- 
times making the initial states of the mixed substances 
closer to each other (in the proper information-theoretic 
sense) can make the amount of mixing work larger. These 
are warnings against a direct association of physical ir- 
reversibility (i.e., mixing work) with lack of information: 
while the amount of mixing work is non-zero due to less 
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information on the identity of atoms in the post-mixed 
state, the relation of this lack of information to the physi- 
cal irreversibility can be non-trivial and counter-intuitive. 

Acknowledgments 

It is a pleasure to thank Roger Balian for inspiring dis- 
cussions and Peter Keefe for carefully reading the proofs. 



A.E. A. acknowledges hospitality at the University 
of Amsterdam. His work was partially supported by 
the Stichting voor Fundamenteel Onderzoek der Materie 
(FOM, financially supported by the Nederlandse Organ- 
isatie voor Wetenschappelijk Onderzoek (NWO)) and by 
CRDF grant ARP2-2647-YE-05. 



[1] L.D. Landau and E.M. Lifshitz, Statistical Physics, I, 
(Pergamon Press Oxford, 1978). 

[2] R. Balian, From Microphysics to Macrophysics, volume 
I, (Springer, 1992). 

[3] J.W. Gibbs, On the Equilibrium of Heterogeneous Sub- 
stances, (Connecticut Acad. Sci., 1875-1878); Selected 
Papers on Thermodynamics and Statistical Physics, 
(Cambridge University Press, London, 1951). 

[4] E. Schrodinger, Statistical Thermodynamics, (Cambridge 
University Press, London, 1952). 

[5] I.E. Tamm, Usp. Fiz. Nauk, 6, 112, (1926). 

[6] A. Lande, New Foundations of Quantum Mechanics, 
(Cambridge University Press, Cambridge, 1965); Foun- 
dations of Quantum Theory, (Yale University Press, 
Yale, 1955). 

[7] M.J. Klein, Am. J. Phys., 26, 80 (1958). 

[8] V.L. Luboshitz and M.I. Podgoretskii, Sov. Phys. Usp., 
14, 662, (1972). 

[9] Ya.M. Gelfer, V.L. Luboshitz and M.I. Podgoretskii, 
Gibbs Paradox and Identity of Particles in Quantum Me- 
chanics, (Nauka, Moscow, 1975) (in Russian). 
[10] A.M. Lesk, J. Phys. A, 13, Llll (1980). 
[11] Yu.S. Varshavskii and A.B. Sheinin, Dokl. Acad. Nauk 

USSR, 148, 1099 (1963). 
[12] N.G. van Kampen, The Gibbs Paradox, in Essays in The- 
oretical Physics in Honor of Dirk ter Haax, ed. by W.E. 

Parry (Pergamon, Oxford, 1 984). 

[13] C.Y. Tseng and A. Caticha, 'cond-mat /0109324 
[14] D. Dieks and V. van Dijk, Am. J. Phys., 56, 430 (1988). 
[15] L.A. Blumenfeld and A.Yu. Grosberg, Biophysics, 40, 
653 (1995). 

[16] Shu-Kun Lin, Electronical Journal of Theoretical Chem- 
istry, 1, 135 (1996). 

[17] E.T. Jaynes, The Gibbs Paradox, in Maximum Entropy 
and Bayesian Methods, ed. by C. R. Smith, G. J. Erick- 
son and P. O. Neudorfer (Kluwer, Dordrecht, 1992). 

[18] A.E. AUahverdyan, R. Balian and Th.M. Nieuwenhuizen, 
J. Mod. Opt. 51 2703 (2004) 

[19] A.E. AUahverdyan and Th.M. Nieuwenhuizen, Phys. 
Rev. E, 64, 056117 (2001). 

[20] P.A. Corning, S.J. Kline, Systems Research and Behav- 
ioral Science, 15, 273 (1998). 

[21] E. Schrodinger What is life?, (Cambridge University 
Press, Cambridge, 1944). 

[22] M.O. Scully, Y. Rostovtsev, Z. Sariyanni, and M.S. 
Zubairy, Physica E 29, 29 (2005); Y. Rostovtsev, Z. 
Sariyanni, M.S. Zubairy and M.O. Scully, ibid, pp 40; Z. 
Sariyanni, Y. Rostovtsev, M.S. Zubairy and M.O. Scully, 
ibid, pp 47. 

[23] R. Rosen, Phil. Science, 31, 232 (1964). 



[24] A. Peres, Quantum Theory: Concepts and Methods, 

(Kluwer Academic Publishers, 1995). 
[25] A.E. AUahverdyan, R. Balian and Th.M. Nieuwenhuizen, 

Europhys. Lett., 61, 452 (2003); cond-mat /04083 16 
[26] E. Schrodinger, Proc. Cambridge Philos. Soc. 32, 446 

(1936); 

L. P. Hughston, R. Jozsa, and W. K. Wootters, Phys. 

Lett. A 183, 14 (1993). 
[27] A. E. AUahverdyan and Th.M. Nieuwenhuizen, Phys. 

Rev. E 71, 066102 (2005) 
[28] G.N. Bochkov and Yu.E. Kuzovlev, Sov. Phys. JETP, 

45, 125 (1977). 

I.M. Bassett, Phys. Rev. A 18, 2356 (1978). 

W. Pusz and S.L. Woronowicz, Comm. Math. Phys., 58, 

273 (1978). 

A. Lenard, J. Stat. Phys., 19, 575 (1978). 

W. Thirring, A Course in Mathematical Physics 4- 

Quantum mechanics of large systems, (Springer, Vienna, 

1983). 

H. Tasaki, |conJ- mat/0009244 
J. Kurchan, cond-mat/0007360 

A.E. AUahverdyan and Th.M. Nieuwenhuizen, Physica 
A 305, 542 (2002). 

[29] G. Lindblad, Non-Equilibrium Entropy and Irreversibil- 
ity, (D. Reidel, Dordrecht, 1983). 

[30] A. E. AUahverdyan and Th. M. Nieuwenhuizen, Phys. 
Rev. E, 71, 046107 (2005). 

[31] A.E. AUahverdyan and Th.M. Nieuwenhuizen, Phys. 
Rev. Lett. 85, 1799 (2000); Th.M. Nieuwenhuizen and 
A.E. AUahverdyan, Phys. Rev. E 66, 036102 (2002). 

[32] D. Janzing, to appear in J. Stat. Phys.; 
quant-ph/0502019 

[33] P. Perrot, A to Z of Thermodynamics, (Oxford Univer- 
sity Press, 1998). 

[34] A.E. AUahverdyan, R. Balian and Th.M. Nieuwenhuizen, 
Europhys. Lett. 66, 419 (2004); cond-mat/04 01574 

[35] V. Jurdjevic and H.J. Sussmann, J. Diflt. Eqn., 12, 313 
(1972). 

A.G. Butkovskiy and Yu.I. Samoilenko, Control of Quan- 
tum Mechanical Processes and Systems, (Kluwer Aca- 
demic, Dordrecht, 1990). 
[36] D. Janzing, quant-ph/0511271 

[37] A.W. Marshall and I. Olkin, Inequalities; Theory of Ma- 
jorization and its Applications, (Academic Press, New 
York, 1979). 

[38] S.L. Braunstein and CM. Caves, Phys. Rev. Lett., 72, 
3439 (1994). 

C.A. Fuchs and CM. Caves, Open Systems & Informa- 
tion Dynamics 3, 1 (1995); quant-ph/9604001 
[39] D.J.C. Bures, Trans. Am. Math. Soc, 135, 199 (1969). 



16 



A. Uhlmann, Rep. Math. Phys., 9, 273 (1976). 
M. Hiibner, Phys. Lett. A, 163, 239 (1992). 
R. Jozsa, J. Mod. Opt., 41, 2315 (1994). 
[40] M.O. Scully, Phys. Rev. Lett. 87, 220601 (2001); ibid. 
88, 050602 (2002). 

Y. V. Rostovtsev, A.B. Matsko, N. Nayak, M.S. Zubairy, 
and M.O. Scully, Phys. Rev. A 67, 053811 (2003). 
A.E. HiU, Yu.V. Rostovtsev, and M.O. Scully, Phys. Rev. 
A 72, 043802 (2005). 
[41] T. Feldmann and R. Kosloff, Phys. Rev. E 61, 4774 

(2000) . 

R. Kosloff and T. Feldmann, Phys. Rev. E 65, 055102(R) 

(2002). 

Y. Rezek and R. Kosloff, |quant-ph/0601006| 
J.P. Palao, R . KosIoIJ aM 7m. Gordon, 
"quant-ph/0106048l 
[42] J. Gemmer, A. Otte, and G. Mahler Phys. Rev. Lett. 86, 
1927 (2001). 

J. Gemmer and G. Mahler, Eur. Phys. J. D, 17, 385 

(2001) ; ibid. 31, 249 (2003). 

J. Gemmer, M. Michel and G. Mahler, Quantum Ther- 
modynamics, vol. 657 of Lecture Notes in Physics 
(Springer, Berlin, 2004). 

[43] A.E. AUahverdyan, R. Serral Gracia, and Th.M. 
Nieuwenhuizen, Phys. Rev. Lett. 93, 260404 (2004); 
Phys. Rev. E 71, 046106 (2005). 

[44] D. Janzing, P. Wocjan, R. Zeier, R. Geiss, and Th. Beth. 
Int. Jour. Theor. Phys. 39, 2217 (2000). 

[45] Th.M. Nieuwenhuizen and A.E. AUahverdyan, Unmask- 
ing Maxwell's Demon, in: Quantum Limits to the Second 
Law, AIP Conf. Proc. Vol. 643 (2002), pp. 436. 

[46] R.C. Tolman, The Principles of Statistical Mechanics, 
(Oxford University Press, London, 1938). 



APPENDIX A: THE FORMULATION OF THE 
SECOND LAW SETS THE CHOICE OF THE 
ENTROPY 

Here we sliall recall why entropy is not uniquely defined 
and why its possible definitions depend on the second law 
of thermodynamics. 

Consider an adiabatically isolated process done on a 
quantum system described by density matrix p{t). The 
process is realized via a time-dependent Hamiltonian, 
with the cyclic feature defined according to Eq. (??). The 
evolution of the system starts from some Gibbsian equi- 
librium state at a positive temperature. It is well known 
from thermodynamics P, Q and can be derived from the 
first principles of quantum mechanics — see and 
our discussion in section ?? — that in this process the sys- 
tem consumes positive work which is the statement of 
the second law in Thomson's formulation. It is natural 
to look for the counterpart of this formulation in terms 
of entropy. Since the dynamics of the thermally isolated 
system is unitary, the von Neumann entropy >5'vN(p(i)) is 
constant in time; so it is not suitable for being the coun- 
terpart of the Thomson's formulation. This argument 
is sometimes dismissed on the ground that the unitary 
dynamics is reversible and thus the constant behavior of 
the von Neumann entropy is reasonable. In the present 



context this seems incorrect, in particular, because a pos- 
itive amount of work is put into the system in accordance 
with Thomson's formulation of the second law. 

In the spirit of the relevant entropy approach 0] (there 
are many entropies each one for its own situation and its 
own use) we can regard as physical another entropy 

Srit) ^ -J2^kit)lmrkit), (91) 

k 

with TTfe being the time-dependent probabilities of var- 
ious values of the system's energy (given by the time- 
dependent Hamiltonian) in the state p{t). This defini- 
tion of entropy was proposed and advocated by Tolman 
j46j . For the considered process, St does have several 
reasonable properties: 

(1) At the end of the cyclic-Hamiltonian process St is 
larger than in the initial equilibrium state |28l ;.29J . 

(2) Under conditions specified in js^l, the change of 
St is minimal for the adiabatically slow process, again 
as required by thermodynamics. 

(3) S't is maximal in equilibrium. 

Each of these three features corresponds to a specific 
formulation of the second law. The features (1) and (2) 
will not be valid when using the von Neumann entropy. 

Thus, we are led to employ the Tolman definition of 
entropy following to the requirements of the second law. 

Let us now consider an isothermal process, where the 
system (e.g., a spin or a brownian particle) weakly inter- 
acts with an equilibrium thermal bath at temperature T. 
The bath being in equilibrium means for the present con- 
text two things. First, it starts in the equilibrium state 
at temperature T, and, second, its relevant characteris- 
tic times are much larger than those of the system. (An 
additional feature of weak interactions was stressed by 
us above.) It is again well known from phenomenological 
thermodynamics, and is derived from the first principles 
of quantum mechanics that during the relaxation of the 
system to equilibrium, the (non-equilibrium) free energy 
decays, a statement known as H-theorem [H 13. l29|: 

^ = ±[Eit)-TSMt)] 

= ^ [tr {p{t) H) + Ttr {p{t) In p{t) )] < 0, (92) 

where p{t) is the density matrix of the system, and where 
H is its time-independent Hamiltonian. Note especially 
that the H-theorem will in general not be valid if instead 
of the von Neumann entropy we shall use in (??) the 
Tolman entropy S't. Thus, here for isothermal processes 
we had to return to the von Neumann definition of en- 
tropy. What is the proper definition of entropy when the 
process is neither isothermal nor thermally isolated is in 
general not known |3ll |. 

In short, in statistical physics the definitions of entropy 
are contextual, since they already depend on various for- 
mulations of the second law. It is, therefore, questionable 
whether arguments based on entropies are able to resolve 
thermodynamical paradoxes. 
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APPENDIX B: A USEFUL IDENTITY 

Here we outline how to calculate the overlap defined 
in (??) for two spin i density matrices 

p= i(l+ri„-a), a =1,2. (93) 

We need the following facts. First, note that the square 
root of p is most conveniently calculated when represent- 
ing p as 



and 



P=\ (1 +sin6'x- o') , 



(94) 



where < < Tr/2, and where x is a unit vector \x\ = 1. 
Then 



(95) 



\/^= V 2 ( cos - +sin-x- ^ ) , 



tTy/p = V2cos-^ ^^ + ^/^^W■ (96) 

Next, we need the known identity for Pauli matrices 

(ni • a) {n2 ■ a) = {Hi ■ n2) + ia ■ [fii x 712], (97) 

where [ni x n2 ] is the vector product. And, finally, the 
last ingredient is given by 

(X-^) (f^) {x-^)={'i{x-i)x-i)-^, (98) 

where x and ^ are unit vectors. Eq. (??) follows from 
(??) and the double vector product identity: 

[na X [ni X 712 ] ] = rzi (n2 • ns) - n2 (ni • m). (99) 



